Colleyville
PathM3: A Multimodal Multi-Task Multiple Instance Learning Framework for Whole Slide Image Classification and Captioning
Zhou, Qifeng, Zhong, Wenliang, Guo, Yuzhi, Xiao, Michael, Ma, Hehuan, Huang, Junzhou
In the field of computational histopathology, both whole slide images (WSIs) and diagnostic captions provide valuable insights for making diagnostic decisions. However, aligning WSIs with diagnostic captions presents a significant challenge. This difficulty arises from two main factors: 1) Gigapixel WSIs are unsuitable for direct input into deep learning models, and the redundancy and correlation among the patches demand more attention; and 2) Authentic WSI diagnostic captions are extremely limited, making it difficult to train an effective model. To overcome these obstacles, we present PathM3, a multimodal, multi-task, multiple instance learning (MIL) framework for WSI classification and captioning. PathM3 adapts a query-based transformer to effectively align WSIs with diagnostic captions. Given that histopathology visual patterns are redundantly distributed across WSIs, we aggregate each patch feature with MIL method that considers the correlations among instances. Furthermore, our PathM3 overcomes data scarcity in WSI-level captions by leveraging limited WSI diagnostic caption data in the manner of multi-task joint learning. Extensive experiments with improved classification accuracy and caption generation demonstrate the effectiveness of our method on both WSI classification and captioning task.
- North America > United States > Texas > Tarrant County > Colleyville (0.04)
- North America > United States > Texas > Tarrant County > Arlington (0.04)
- Health & Medicine > Therapeutic Area > Oncology (0.49)
- Health & Medicine > Diagnostic Medicine > Imaging (0.47)
Perspective: The risk that AI poses to religious freedom
We frequently hear in the 21st century that data is the new oil. Those who controlled oil flows in the 1970s had a near stranglehold on the global economy. Today, those who hold data might well control the new economy. Data, however, is diffuse, hard to track and nearly impossible to regulate, which could have unparalleled implications for human rights and religious freedom. Big data companies have poured billions into research to bring technology and data into direct contact with us every day through artificial intelligence.
- South America > Venezuela (0.18)
- Asia > Middle East > Iran (0.17)
- North America > United States > Texas > Tarrant County > Colleyville (0.05)
- (4 more...)
- Media > News (1.00)
- Government > Regional Government > North America Government > United States Government (0.32)
- Government > Regional Government > Asia Government (0.31)